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Abstract. Power circuits are data structures which support efficient algorithms 
for highly compressed integers. Using this new data structure it has been shown 
recently by Myasnikov, Ushakov and Won that the Word Problem of the one- 
relator Baumslag group is in P. Before that the best known upper bound has been 
non-elementary. In the present paper we provide new results for power circuits 
and we give new applications in algorithmic algebra and algorithmic group the- 
ory: 1. We define a modified reduction procedure on power circuits which runs 
in quadratic time thereby improving the known cubic time complexity. The im- 
provement is crucial for our other results. 2. We improve the complexity of the 
Word Problem for the Baumslag group to cubic time thereby providing the first 
practical algorithm for that problem. 3. The main result is that the Word Problem 
of Higman's group is decidable in polynomial time. The situation for Higman's 
group is more complicated than for the Baumslag group and forced us to advance 
the theory of power circuits. 
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Introduction 

Power circuits have been introduced in 11201 . It is a data structure for integers which supports +, 
— , <, a restricted version of multiplication, and raising to the power of 2. Thus, by iteration it 
is possible to represent (huge) values involving the tower function by very small circuits. An- 
other way to say this is that efficient algorithms for power circuits yield efficient algorithms for 
arithmetic with integers in highly compressed form. This idea of efficient algorithms for highly 
compressed data is the main underlying theme of the present paper. In this sense our paper is 
more about compression and data structures than about algorithmic group theory. However, the 
applications are in this area so far. 

Indeed as a first application of power circuits, 121] showed that the Word Problem of the 
Baumslag group 3 is solvable in polynomial time. Algorithmic interests have a long history in 
combinatorial group theory. In 1910 Max Dehn Q formulated fundamental algorithmic problems 
for (finitely presented) groups. The most prominent one is the Word Problem: "Given a finite 
presentation of some fixed group G, decide whether an input word w represents the trivial element 
la in G." It took more than four decades until Novikov and Boone showed (independently) in 
the 1950's the existence of a fixed finitely presented group with an undecidable Word Problem, 
[ 22 3 1 . It is also true that there are finitely presented groups with a decidable Word Problem but 



Sometimes called Baumslag-Gersten group, e.g. in 1231 or in a preliminary version of Ell. 



with arbitrarily high complexity, ||24| Theorem 1.3]. In these examples the difficult instances for 
the Word Problem are extremely sparse, (because they encode Turing machine computations) 
and, inherently due to the constructions, these groups never appear in any natural setting. 

In contrast, the Baumslag group G(i,2) is given by a single defining relation, see Sect. [4] (It 
is a non-cyclic one-relator group all of whose finite factor groups are cyclic IT).) It has been a 
natural (and simplest) candidate for a group with a non-polynomial Word Problem in the worst 
case, because the Dehn function 4 of G(i,2) is non-elementary by a result due to Gersten 11101 . see 
also 1231 . Moreover, the only general way to solve the word problem in one-relator groups is by 
a Magnus break-down procedure M19I171 which computes normal forms. It was developed in the 
1930s and there is no progress ever since. Its time-complexity on Gn t 2) is non-elementary, since 
it cannot be bounded by any tower of exponents. 

So, the question of algorithmic hardness of the Word Problem in one-relator groups is still 
wide open. Some researchers conjecture it is polynomial (even quadratic, see J2]), based on ob- 
servations on generic-case complexity |]13| . Others conjecture that it cannot be polynomial, based 
on the fast growing Dehn functions. (Note that the Dehn function gives a lot information about 
the group. E.g., if it is linear, then the group is hyperbolic, and the Word Problem is linear. If it is 
computable, then the Word Problem is decidable 1181 .) 

The contributions of the present paper are as follows: In a first part, we give new efficient 
manipulations of the data structure of power circuits. Concretely, we define a new reduction 
procedure called ExtendTree on power circuits. It improves the complexity of the reduction 
algorithm from cubic to quadratic time. This is our first result. It turns out to be essential, because 
reduction is a fundamental tool and applied as a black-box operation frequently. For example, 
with the help of a better reduction algorithm (and some other ideas) we can improve as our 
second result the complexity of the Word Problem in Gm 2) significantly from 0(n 7 ) in 1211 
down to C(n 3 ), Thm.|3] This cubic algorithm is the first practical algorithm which works for that 
problem on all reasonably short instances. 

The basic structure in our paper is the domain of rational numbers, where nominators are 
restricted to powers of two. Thus, we are working in the ring Z[l/2]. We view Z[l/2] as an 
Abelian group where multiplication with 1 /2 is an automorphism. This in turn can be embedded 
into a semi-direct product Z[l/2] x Z which is the set of pairs (r, k) 6 Z[l/2] x Z with the 
multiplication (r, k) (s, £) = (r + 2 k s, k + £) . There is a natural partially defined swap operation 
which interchanges the first and second component. Semi-direct products (or more generally, 
wreath products) appear in various places as basic mathematical objects, and this makes the 
algebra Z[l/2] x Z with swapping interesting in its own right. This algebra has a Word Problem 
in 0{n i ) (Thm. [2j. The Word Problem of Gn t 2) can be understood as a special case with the 
better C(n 3 ) performance. 

Another new application of power circuits shows that the Word Problem in Higman's group 
Hi is decidable in polynomial time. (We could also consider any H q with q > 4.) This is our third 
and main result. Higman 1121 constructed Hi in 1951 as the first example of a finitely presented 
infinite group where all finite quotient groups are trivial. This leads immediately to a finitely 
generated infinite simple quotient group of H4; and no such group was known before Higman's 
construction. The group H4 is constructed by a few operations involving amalgamation (see e.g. 
[26 1). Hence, a Magnus break-down procedure (for amalgamated products) yields decidability of 
the Word Problem. The procedure computes normal forms, but the length of normal forms can 
be a tower function in the input length. (More accurately, one can show that the Dehn function of 
Hi has an order of magnitude as a tower function j4j.) Thus, Higman's group has been another 
natural, but rather complicated candidate for a finitely presented group with an extremely hard 

4 We do not use any result about Dehn functions here, and we refer the interested reader e.g. to 
Wikipedia (or to the appendix) for a formal definition. 
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Word Problem. Our paper eliminates H4 as a candidate: We show that the Word Problem of H4 
is in C(n 6 ) (Thm.gJ. 

We obtain this result by new techniques for efficient manipulations of multiple markings in 
a single power circuit and their ability for huge compression rates. Compression techniques have 
been applied elsewhere for solving word problems, [ 14. 15.25 1 1 1. But in these papers the authors 
use straight-line programs whose compression rates are far too small (at best exponential) to cope 
with Baumslag or Higman groups. 

Due to lack of space some few proofs are shifted to the appendix. 



1 Notation and preliminaries 

Algorithms and (decision) problems are classified by their time complexity on a random-access 
machine (RAM). Frequently we use the notion of amortized analysis with respect to a potential 
function, see e.g. in (6) Sect. 17.3]. 

The tower function r : N — > 2 N is defined as usual: r(0) = 1 and r(i + 1) = 2 r ^ for 

i > 0. Thus, e.g. r(4) = 2 2 = 2 16 and r(6) written in binary requires more bits than there 
are supposed to be electrons in this universe. 

We use standard notation and facts from group theory as the reader can find in the classical 
text book 1171 . In particular, we apply the standard (so called Magnus break-down) procedure for 
solving the word problem in HNN-extensions and amalgamated products. All HNN-extensions 
and amalgamated products in this paper have an explicit finite presentation. Amalgamated prod- 
ucts are denoted by G *a H where A is a subgroup in G and in H. The formal definition of 
G *a H creates first a disjoint copy H' of H. Then one considers the free product G * H and 
adds defining relations identifying a 6 A with its copy a 1 £ A'. We refer to 1261 for the basic 
facts about Higman groups. 



2 Power circuits 

This section is based on 1201 , but we also provide new material like our treatment of multiple 
markings and improved time complexities. 5 Let r be a set and 5 be a mapping 8 : f x r — > 
{ — 1, 0, +1}. This defines a directed graph (T, A), where r is the set of vertices and the set of 
directed arcs (or edges) is A — {(P, Q) 6 -T x r \ 5(P, Q) 7^ 0} (the support of the mapping 
8). Throughout we assume that (r, A) is a dag (directed acyclic graph). In particular, S(P, P) = 
for all vertices P. 

A marking is a mapping M : r — ¥ {— 1, 0, +1}. We can also think of a marking as a subset 
of r where each element in M has a sign (+ or — ). (Thus, we also speak about a signed subset.) 
Each node P G r is associated in a natural way with a marking, which is called its yl-marking 
Ap and which is defined as follows: 

A P -.r->{-l, 0, +1} , Q h-s- S(P, Q) 

Thus, the marking yip is the signed subset which corresponds to the targets of outgoing arcs from 
P. 

5 In order to keep the paper self-contained and as we use a slightly different notation we give 
full proofs in the appendix. 
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We define the evaluation e(P) of a node (e(M) of a marking resp.) bottom-up in the dag by 
induction: 

e(P) = 2 e{Ap) for a node P, 

e(M) = M(P)e(P) for a marking M. 

p 

Note that leaves evaluate to 1, the evaluation of a marking is a real number, and the evaluation of a 
node P is a positive real number. Thus, s(P) and e(M) are well-defined. We have the following 
nice formula for nodes: log 2 (£(P)) = e(Ap). Therefore we can view the marking Ap as "taking 
the logarithm of P". 

Definition 1. A power circuit is a pair 77 = (P, S) with S : r x r — > { — 1, 0, +1} such that 
(P, A) is a dag as above with the additional property that e(A7) 6 Ifor all markings M. 

We will see later in Cor. |2]that it is possible to check in quadratic time whether or not a dag 
(r, A) is a power circuit. (One checks e(Ap) > for all nodes P, Lem.[6]in the appendix.) 

Example 1. We can represent every integer in the range [— n, n] as the evaluation of some mark- 
ing in a power circuit with node set {Po, ■ ■ ■ , Pi} such that e(Pi) = 2 1 for < i < £ and 
I — [log 2 n \ ■ Thus, we can convert the binary notation of an integer n into a power circuit with 
0(log |n|) vertices and C(((log n|) log log |n|) arcs. 

Example 2. A power circuit can realize tower functions, since a line of n + 1 nodes allows to 
represent r(n) as the evaluation of the last node. 

Sometimes it is convenient to think of a marking M as a formal sum M — 2~2p M(P)P. 
In particular, — M denotes a marking with s(— M) = — e{M). For a marking M we denote by 
a(M) its support, i.e., 

o(M) = {P er \ M(P) /0}Cf. 

We say that M is compact, if we have e(P) / e(Q) / 2e(P) for all P, Q G a{M), P / Q, 
If A7 is compact, then we have s(M) — if and only if a(M) = 0, and we have e(M) > if 
and only if M(P) is positive for the node P having the maximal value in a(M). 

The insertion of a new node CLONE(P) without incoming arcs and with A CLONE ( P ) = Ap is 
called cloning of a node P. It is extended to markings, where CLONE(A7) is obtained by cloning 
all nodes in o(M) and defining M(clone(P)) = M(P) forP G a(M) and M(clone(P)) = 
otherwise. We say that M is a source, if no node in o(M) has any incoming arcs. Note that 
CLONE(M) is always a source. 

If M = Y.p M(P)P and K = E P K(P)P are markings, then M + K = T, P ( M (P) + 
K(P))P is a formal sum where coefficients — 2 and 2 may appear. For M(P) + K(P) = ±2, let 
P' = CLONE(P). We define a marking (M + K)' by putting (M+K)'(P) = (M+K)'(P') = 
±1. In this way we can realize addition (and subtraction) in a power circuit by cloning at most 
\o(M) n a(K)\ nodes. 

Next, consider markings U and X with e(U) = u and e(X) — x such that 112^ G Z (e.g. 
due to x > 0). We obtain a marking V with e(V) = u2 x and |cr(y)| = \a(U)\ as follows. 
First, let V = CLONE(L'') and X' = CLONE(X). Next, introduce additional arcs between all 
P' G ff(V) and Q' G X' with 8{P' , Q') = X'(Q')- Note that the cloning of X avoids double 
arcs from V to X. The cloning of U is not necessary, if U happens to be a source. 

We now introduce an alternative representation for power circuits which allows us to compare 
markings efficiently. The process of tranforming a II into this so-called tree representation is 
referred to as reduction of a power circuit . 
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Definition 2. A tree representation of a power circuit II = (_T, 5) consists of 

i) r as a list [Pi, . . . , P n ] such that e(Pi) < e(Pi+i) for all 1 < i < n, 

ii) a bit vector b(l), . . . , b(n — 1) where b(i) — 1 if and only if2e(Pi) = e(Pi+i), and 

Hi) a ternary tree of height n, where each node has at most three outgoing edges, labeled by 
+1, 0, and — 1. All leaves are at level n, and each leaf represents the marking M : T —¥ 
{ — 1, 0, +1} given by the labels of the unique path of length n from the leaf to the root of 
the tree. These markings must be compact. Furthermore, all markings Ap for P £ r are 
represented by leaves. Finally, for each level in the tree T, we keep a list of the nodes in that 
level. 

If a path from some leaf to the root is labeled (0, +1, —1, 0, +1), then that leaf represents 
the marking P2 — P3 + P5 and we know (due to compactness) e(-p2) < 2e(Ps) < 4e(Ps). The 
amount of memory for storing a tree representation is bounded by 0(|-T| • (number of leaves)). 
Part|nIJ of the definition is only needed inside the procedure ExtendTree, which is explained 
in the appendix. For simplicity the reader might want to ignore it in a first reading and just think 
of a tree representation as the power circuit graph plus an ordering of the nodes and a bit vector 
keeping track of doubles. 

Proposition 1 (|20|). There is a 0(\r\) time algorithm which on input a tree representation A 
of a power circuit and two markings K and M (given as leaves) compares e(K) and e(M). It 
outputs whether the two values are equal and if not, which one of them is larger. In the latter case 
it also tells whether their difference is 1 or > 2. 

Proof. Start at the root of the tree and go down the paths corresponding to K and M in parallel. 
The first pair of different labels on these paths determines the larger value. The check whether 
e(K) = e(M) + 1 is equally easy due to compactness. □ 

Definition 3. Let II = (P, 5) be a power circuit. A chain (of length r) in II is a sequence of 
nodes (Po, Pi, ... , P r ) where e(Pi) = 2 z e(Po) (0 < i < r). A chain is maximal if it is not part 
of a longer chain. The number of maximal chains in II is denoted c(i7). We define the potential 
ofn to be pot(77) = c(i7) ■ \r\. 

The following statement uses amortized time w.r.t. the potential function pot(i7). Note that 
the potential pot(i7) remains bounded by |_Tj 2 since c(i7) < |_T|. 

Theorem 1. The following procedure EXTENDTREE runs in amortized time 0((\r\ + \U\)-\U\): 
Input: A dag II — (PClU, 5), where P and U are disjoint with no arcs pointing from P to U 
and such that (P, S\rxr) is a power circuit in tree representation. The potential is defined by the 
potential of its power circuit-part pot((_T, 5\rxr))- The output of the procedure is "no", if II is 
not a power circuit (because s(P) Zfor some node P). In the other case, the output is a tree 
representation of a power circuit II' = (_T', S') where: 

i) r C r' and S\rxr = <5'|rxr- 

ii) \r'\ < \r\ + 3 \U\ + (c(i7) - c(i7')) 

Hi) For all Q £ U there exists a node Q' £ P' with e(Q) = e(Q'). 

iv) For every marking M in II there exists a marking M' in PI' with e(M') = e(M) and 
\a{M')\ < \a{M)\. 

For a(M) Cfwe can choose M = M' byQ. If some further markings Mi, M2, ■ ■ ■ , M m are 
part of the input (those where a (M) ClU 7^ 0), we need additional amortized time 0(\a(AIi) \ + 
h \a(M m )\ + m ■ (\r\ + \U\)) to find the corresponding M'i,M' 2 , M' m . 
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Corollary 1. There is aO(\T\ ) time procedure MAKETREE that given a (graph representation) 
of a power circuit II = [T,S) computes a tree representation. The number of nodes at most 
triples. 

Proof. This is a special case of Thm.Ewhenf/ = 77. □ 

Corollary 2. The test whether a dag (F, S) defines a power circuit can be done in C(|^~'| 2 )- 

The efficiency of MakeTree is crucial for all our results. In particular, Cor. [T]improves the 
cubic time complexity of 1201 for reduction to quadratic. 

3 Arithmetic in the semi-direct product Z[l/2] x Z 

The basic data structure for this paper deals with the semi-direct product 2[l/2] xZ. Here Z[l/2] 
denotes the ring of rational numbers with denominators in 2 N . Thus, an element in Z[l/2] is a 
rational number r which can be written as r — u2 x with u, x £ Z. We view Z[l/2] as an 
abelian group with addition. Multiplication by 2 defines an automorphism of Z [1/2], and hence 
the semi-direct product Z[l/2] x Z becomes a (non-commutative) group where elements are pairs 
(r, m) £ Z[l/2] x Z and with the following explicit formula for multiplication: 

(r, m) ■ (s, n) = (r + 2 m s, m + n) 

The semi-direct product Z[l/2] x Z is also isomorphic to a group with two generators a and 
t and the defining relation tat^ 1 — a 2 . This group is known as the Baumslag-Solitar group 
BS(1, 2). The isomorphism from BS(1, 2) to Z[l/2] x Z maps a to (1, 0) and t to (0, 1). This 
is a homomorphism due to (0, 1)(1, 0)(0, —1) = (2, 0). It is straightforward to see that it is 
actually bijective. 

We have (r, m) -1 = (— r2~ m , —m) in Z[l/2] x Z, and a sequence of s group operations 
may lead to exponentially large or exponentially small values in the first component. Binary 
representation can cope with these values so there is no real need for power circuits when dealing 
with the group operation, only. 

We equip Z[l/2] x Z with a partially defined swap operation. For (r, m) £ Z x Z C 
Z[l/2] x Z we define swap(r, m) — (m,r). This looks innocent, but note that a sequence of 
2°^' defined operations starting with (1,0) may yield a pair (0, r(n)) where r is the tower 
function. Indeed swap(l, 0) = (0, 1) = (0, r(0)) and 

swap((0, r(n))(l, 0)(0, -r(n)) = swap(r(n + 1), 0) = (0, r{n + 1)). (1) 

However, we will show in section[6] 

Theorem 2. The Word Problem of the algebra Z[l/2] x Z with swapping is decidable in 0(n 4 ). 

We use triples to denote elements in Z [1/2] x Z. A triple [it, x, k] with u,x,k £ Z and 
x < < k denotes the pair (u2 x ,k + x) £ Z[l/2] x Z. For each element in Z[l /2] x Z there are 
infinitely many corresponding triples. Using the generators a and t of BS(1, 2) one can write: 

[u, x, k] = (u2 x ,k + x) = (0, x)(u, k) £ Z[l/2] x Z 

= t x a u t k £ BS(1,2) and 

[u, x, k] ■ [v, y, t] = [u2~ v + v2 k , x + y,k + £] 

In the following we use power circuits with triple markings for elements in Z[l/2] x Z. We 
consider T = [U, X, K], where U, X, K are markings in a power circuit with e(U) = it and 
e{X) = x < < e(K) = k; and we define e(T) £ Z[l/2] x Z to be the triple e(T) = 
[u, x, k] — (u2 x ,x + k). 
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4 Solving the Word Problem in the Baumslag group 



The Baumslag group G(i,2) is a one-relator group with two generators a and b and the denning 
relation a a = a 2 . (The notation g h means conjugation, here g h — hghT x . Hence a a = 
bab~ aba~ b^ 1 . ) The groupG(i ; 2) canbe written as an HNN extension of BS(1, 2) ~Z[l/2]x 
Z with stable letter b; and BS(1, 2) is an HNN extension of Z ~ (a) with stable letter t: 

(a, 6 | a a = a 2 ) ~ (a, i, | a* = a 2 , a 6 = t) 

~ HNN((a, t | a' = a 2 ),b, (a) ~ (t» 

~ HNN (HNN((a),t, (a) ~ (a 2 )),b, (a) ~ (t)) 

Before the work of Myasnikov, Ushakov and Won ( 11211 ) G(i,2) had been a possible candidate 
for a one-relator group with an extremely hard (non-elementary) word problem in the worst case 
by the result of Gersten 11101 . (Indeed, the tower function is visible as follows: Let T(0) = t and 
T(n + 1) = 6T(n)aT(n) _1 6" 1 . Then T(n) = t T(n) by a translation of Eq.Q]) The purpose 
of this section is to improve the 0(n 7 ) time-estimation of 1211 to cubic time. Thm.|3]yields also 
the first practical algorithm to solve the Word Problem in the Baumslag group for a worst-case 
scenario 6 . 

Theorem 3. The Word Problem of the Baumslag group G(i,2) > s decidable in time 0(n 3 ). 

Proof. We assume that the input is already in compressed form given by a sequence of letters 6 
and pairwise disjoint power circuits each of them with a triple marking [U, X, K] representing 
an element in Z[l/2] x Z, which in turn encodes a word over a ±:L 's and t 's. 
We use the following invariants: 

i) U, X, K have pairwise disjoint supports. 

ii) U is a source. 

iii) All incoming arcs to X U K have their origin in U. 

iv) Arcs from U to X have the opposite sign of the corresponding node-sign in X. 

These are clearly satisfied in case we start with a sequence of a ±1? s, t s, and fe^'s. The 
formula [it, x, k] ■ [v, y, £\ = [u2~ y + v2 k , x + y, k + £\ allows to multiply elements in Z[l/2] x Z 
without destroying the invariants or increasing the total number of nodes in the power circuits 
(the invariants make sure that cloning is not necessary). The total number of multiplications is 
bounded by n. Taking into account that there are at most n 2 arcs, we are within the time bound 
0(n 3 ). 

Now we perform from left-to-right Britton reductions, see [1 171 . In terms of group generators 
this means to replace factors &a"6 _1 by t s and b~ 1 t s b by a" . Thus, if we see a subsequence 
6[it, x, k]b~ L , then we must check if x + k = and after that if u2 x G Z. If we see a subsequence 
b^ 1 [it, x, k]b, then we must check it = 0. In the positive case we swap, in the other case we 
do nothing. Let us give the details: For a test we compute a tree representation of the circuit 
using MakeTree which takes time 0(n 2 ). After each test for a Britton reduction, the tree 
representation is deleted. There are two possibilities for necessary tests. 

1.) it = 0. If yes, remove in the original power circuit the source U, this makes X U K a source; 
replace [u, x, k] by [x + k, 0, 0]. The invariants are satisfied. 

6 It is easy to design simple algorithms which perform extremely well on random inputs. But for 
all these algorithms fail on short instances, e.g. in showing tT(6) = T(6)t. 
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2.) x + k = 0. If yes, check whether u2 x G Z. If yes, replace [u, x, k] in the original power 
circuit by either [0, u2 x , 0] or [0, 0, u2 x ] depending on whether u2 x is negative or positive. 
We get u2 x without increasing the number of nodes, since arcs from U to X have the opposite 
signs of the node-signs in X. Thus, if E has been the set of arcs before the test, it is switched 
to U x X \ E after the test. The new marking for u2 x is a source and does not introduce any 
cycle, because its support is still the support of the source U. 

It is easy to see that computing a Britton reduction on an input sequence of size n, we need at 
most 2n tests and at most n of them are successful. Hence we are still within the time bound 

0(n 3 ). 

At the end we have computed in time 0(n 3 ) a Britton-reduced normal form where inner 
parts (i.e. the ones not involving 6 ±1? s) are given as disjoint power circuits. The result follows 
straightforwardly. □ 



5 Higman groups 

The Higman group H q has a finite presentation with generators oi , . . . , a q and defining relations 
a p a p ~ia p 1 = a p _ 1 for all p G Z/qZ, From now on we interpret indices p for generators a p as 
elements of Z/qZ. In particular, a q = ao and one of the defining relations says a^aqO^ 1 = a q . 
It is known 1 26 1 that H q is trivial for q < 3 and infinite for q > 4. Hence, in the following we 
assume q > 4. The group H4 was the first example of a finitely generated group where all finite 
quotient groups are trivial. It has been another potential natural candidate for a group with an 
extremely hard (non-elementary) word problem in the worst case. Indeed, define: 

w(p, 0) = a p forp G Z/qZ 

w(p — 1, i + 1) = w(p, i)a p -iw(p, i)^ 1 for i G N and G Z/qZ 

By induction, w(p, n) = oj'"' G H q , where r{n) is the n-th value of the tower function, but the 
length of the words w(p, n) is 2 n+1 — 1, only. Hence there is a "tower-sized gap" between input 
length and length of a canonical normal form. 7 

For i,j G N with i < j we define the group Gj , . . . , j by the generators a, , a j+i , . . . , a-j G 
{ai , . . . ,a q }, and defining relations a p a p -ia p 1 = a^,-\ for alii < p < j. Note that each Gi ~ 
Z is the infinite cyclic group. The group G\... q is not H q because the relation aidqa^ 1 = a q is 
missing, but H q is a (proper) quotient of G\,.. q . The groups G^+i are. by the very definition, 
isomorphic to the Baumslag-Solitar group BS(1,2), hence Gi^+i ~ Z[l/2] x Z. It is also 
clear that Gi,...j+i ~ Gi,...,j *Gj Gjj+i for j — i < q. Thus, G123 — G12 *g 2 G2,3 and 

G341 — G34 *G 4 G41. 

For simplicity we deal with q — 4 only. The free group F13, generated by ai and 03 is a 
subgroup of G123 as well as a subgroup of G341 , see e.g. 1261 . Thus we can build the amalgamated 
product G123 *f 13 G341 and a straightforward calculation shows 

H4 ~ G123 *f 13 G341. 

This isomorphism yields a direct proof that H4 is an infinite group, see 1261 . In the following 
we use the following well-known facts about amalgamated products, see I17I26I8I . The idea is to 
calculate an alternating sequence of group elements from G123 and G341 . The sequence can be 
shortened, only if one factor appears to be in the subgroup F13. In this case we swap the factor 

7 This can be made more precise (and rigorous) by saying that the Dehn-function of H4 grows 
like a tower function (l4l) 
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from G123 to G341 and vice versa. By abuse of language we call this procedure again a Britton 
reduction. (This is perhaps no standard notation in combinatorial group theory, but it conveniently 
unifies the same phenomenon in amalgamated products and HNN-extensions; and the notion of 
Britton reduction generalizes nicely to fundamental groups of graphs of groups.) Elements in the 
groups d.i+i are represented by triple markings T = [U, X, K] in some power circuit. In order 
to remember that we evaluate T in the group d.i+i, we give each T a type (i, i + 1), which is 
denoted as a subscript. For e(T) = [u, x, k] we obtain: 

e(2(i,i+i)) = a i+i a i a i+i 6 Gi,i+i 

= (Zj <lj + l 11 UZ £ & 

The following basic operations are defined for all indices i, i + 1, i + 2 and i € Z/4Z, but 
for better readability we just use indices 1, 2, and 3. 

- Multiplication: 

[u,x,k] (h2 ) ■ [v,y,l](i, 2 ) = [u2~ y + v2 k ,x + y,k + £] (iay (2) 

- Swapping from (1, 2) to (2, 3): 

[0,a;,fc] ( i,2) = [z + fc,0,0](2,3)- 0) 

- Swapping from (2, 3) to (1, 2): 

M,0] (2 ,3) = [0,0,21(1,3) for 2 > 0. (4) 

M,0] (2 ,3) = [0,2,0] (1 , 2 ) for 2 < 0. (5) 

- Splitting: 

[u, as, A] (i )2 ) = [i^*, 0,0] (1,2) ■ [0,ar, fc](i, 2 ) for uT 6 Z. (6) 

[w»as,A](2,3) = [0,a;,fe](2,3) • [u2~ k , 0, 0] (2 , 3 ) forit2 _fc e Z. (7) 

From now on we work with a single power circuit II together with a sequence Tj (j £ J) of 
triple markings of various types. This is given as a tuple T = (r, 8; (Tj)j e j). We allow splitting 
operations only in combination with a multiplication, thus we never increase the number of triple 
markings inside T. A tuple T = [T, 5; where (F, S) is in tree representation is called 

a main data structure . We keep T as a main data structure by doing addition and multiplication 
by powers of 2 using clones and calling ExtendTree on these after each basic operation. 

Definition 4. The weight cj(T) of a triple marking T — [U, X, K] is defined as 

u;(T) = \a(U)\ + \a(X)\ + \a(K)\. 

The weight u(T) of a main data structure T is defined as oj(T) = j 
to size 8 ||T|| is defined by \\T\\ = |T|. 

Proposition 2. Let T = {T, S; (Tj)jej) be a main data structure of size at most m, weight at 
most w (and with \ J\ + w < m). The following assertions hold. 

i) No basic operation increases the weight ofT. 

ii) Each basic operation increases the size ||T|| by 0(w + (c(i7) — c(i7))). 
Hi) Each basic operation takes amortized time O(mvo). 

8 The definition is justified, since we ensure J\ + ui(T) < ||7~|| whenever arguing about ||T||. 
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iv) A sequence of s basic operations takes time 0(smw + m 2 ) and the size of T remains 
bounded by 0(m + sw). 

Proof. Applying a basic operation means replacing the left-hand side of the equation by the right- 
hand side, thus forgetting any markings of the replaced triple(s). We can do the necessary tests, 
because we have a tree representation. For an operation we clone the involved markings, but this 
does not increase the weight. Note that there is time enough to create the clones with all their 
outgoing arcs. This yields the increase in the size by 0(w). With the new clones we can perform 
the operations by using the algorithms described in section [2] on the graph representation of the 
circuit. We regain the main data structure by calling ExtendTree which integrates the modified 
clones into the tree representation. 

In order to get[Iv) we observe that the initial number of maximal chains is m and there are at 
most 0(w) new ones created in each basic operation. Hence the total increase in size is O(sw) 
and the difference in potential is at most m(m + sw). The time bound follows. □ 

5.1 Solving the Word Problem in Higman's group 

Theorem 4. The Word Problem of H4 can be solved in time 0(?i 6 ). 

The rest of this section is devoted to the proof of Thm.|4] For solving the word problem in the 
Higman group H4 the traditional input is a word over generators a p 1 . We solve a slightly more 
general problem by assuming that the input consists of a single power circuit 77 = (r, S) together 
with a sequence of s triple markings of various types. Each triple marking [U, X, K]( PtP+1 ) 

corresponds to 4 fly+i e Hi. 

Let us fix w to be the total weight of T = (F, 5; (Tj)i<j< s ). For simplicity we assume 
s < w and that w and sizes of clones are bounded by ||T|| = \r\. (This is actually not necessary, 
but it simplifies some bookkeeping.) Having s < w < n € 0(w), we can think of n = \F\ as 
our input size. We transform the input T = (r, 5; (Tj)i<j< s ) into a main data structure by a call 
of MakeTree. 

During the procedure \r\ increases, but the number of triple markings remains bounded by s 
and the weight remains bounded by w. 

In order to achieve our main result we show how to solve the word problem with 0(s 2 ) 
basic operations on the main data structure T. Assume we have shown this. Then, by Prop. [2] 
the final size will be bounded by m £ 0(s 2 w); and the time for all basic operations is therefore 
0(s 4 W 2 ) C 0(n 6 ). 

We collect sequences of triple markings of type (1, 2) and (2, 3) in intervals £, which in 
turn receive type (1, 2, 3); and we collect triple markings of type (3, 4) and (4, 1) in intervals of 
type (3, 4, 1). Each interval has (as a sequence of triple markings) a semantics e(£) which is a 
group element either in G123 or in G341 depending on the type of £. Thus, it makes sense to ask 
whether e(£) G Fn. These tests are crucial and dominate the runtime of the algorithm. 

Now the sequence {Tj)i<j< s of triple markings appears as a sequence of intervals: 

(£1, . . . , £/;£/ + i, . . . , £t). 

We introduce a separator ";" dividing the list in two parts. 
The following invariants are kept up: 

i) All £1, . . . , £/ satisfy e(£j) ^ F13. In particular, these intervals are not empty and they 
represent non-trivial group elements in (G123 U G341) \ F13. 

ii) The types of intervals left of the separator are alternating. 



10 



In the beginning each interval consists of exactly one triple marking, thus / = and t = s. 
The algorithm will stop either with 1 < / = t or with / = and t = 1. 

Now we describe how to move forward: Assume first / = 0. (Thus, t > 1.) If e(jCi) $5 -F13, 
then move the separator to the right, i.e. we obtain / = 1. If e(Ci) G -F13, then, after possibly 
swapping Ci, we join the intervals C\ and C2 into one new interval. In this case we still have 
/ = 0, but t decreases by 1 . 

From now on we may assume that < / < t. If Cf and have the same type, then 

append to Cf, and move the separator to the left of Cf. Thus, the values / and t decrease 
by 1. 

If Cf and Cf+i have different types, then we test whether or not e(Cf + i) G F13. If 
e(Cf+i) (ji F13, then move the separator to the right, i.e. t — f decreases by 1. If G F13, 

then we swap and join the intervals Cf and into one new interval. Since we do not 

know whether the new interval belongs to F13, we put the separator in front of it, decreasing both 
/ and t by 1 . 

We have to give an interpretation of the output of this algorithm. Consider the case that we 
terminate with 1 < / = t. Then e(Ci) ■ ■ ■ e(Ct) G H4 is a Britton-reduced sequence in the 
amalgamated product. It represents a non-trivial group element, because t > 1. 

In the other case we terminate with / = and t = 1, We will make sure that the test 
"e(C) G -F13?" can as a by-product also answer the question whether or not e(C) is the trivial 
group element. If we do so, one more test on (Ci) yields the answer we need. 

Now, we analyze the time complexity. Termination is clear once we have explained how to 
implement a test "e(C) G -F13?". Actually, it is obvious that the number of these tests is bounded 
by 2s. Thus, it enough to prove the following claim. 

Lemma 1. Every test "e(C) G F13?" can be realized with O(s) basic operations in the main 
data structure T. The test yields either "no" or it says "yes " with the additional information 
whether or not e(C) is the trivial group element. Moreover, in the "yes" case we can also swap 
the type of C within the same bound on basic operations. 

Proof. Let us assume that C is of type (1, 2, 3), i.e., it contains only triples of types (1, 2) and 
(2, 3). Let s be the length of C. The group G123 is an amalgamated product where F\3 is a free 
subgroup of rank 2, see 1261 for a proof. In a first round we create a sequence of triple markings 

(Ti,...,T t ) 

with t < s such that for 1 < i < t the type of Ti is (1, 2) if and only if the type of Ti+i is (2, 3). 
We can do so by s — t basic multiplications from left-to right without changing the semantics of 
g = e(Tx) ■ ■ ■ e(T t ) G Gi 23 . 

Next, we make this sequence Britton-reduced. Again, we scan from left to right. If we are at 
T — Ti with value [it, x, k] we have to check that either [u, x, fc](i_2) = (0, z) G Z[l/2] xi Z or 
[u, x, fc](2,3) = (z, 0) G Z[l/2] x Z for some integer z G Z. 

For the type (1, 2) we have [u, x, fc](i,2) = (0, z) if and only if u = 0, which in a tree 
representation means that the support of the marking for u is empty. Hence this test is trivial. If 
the test is positive, we can replace [u, x, fe](i 2) by [0, x, fc](i_2) an d we perform a swap to type 
(2, 3). If t > 1 we can recursively perform multiplications with its neighbors, thereby decreasing 
the value t. 

For the type (2, 3) we have [u, x, fc](2,3) = { z , 0) if and only if both k + x = and u2 x G Z. 
Tests are possible in linear time and if successful, we continue as in the precedent case. 

The final steps are more subtle. Let s(Tj) = Qj G G12 U G23. Recall that (g±, . . . , gt) is 
already a Britton-reduced sequence. We have g\ ■ ■ ■ g t G F13 if and only if there is a sequence 
(ho, hi, . . . ,ht) with the following properties: 
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i) ho = h t = 1 and hj G G 2 for all < j < t. 

ii) hj-igj = c^Tij with ^ G Gi U G 3 for all 1 < j < t. 

Assume that such a sequence (ho, hi, . . . , ht) exists. Then we have g'j G Gi if and only if 
9o £ G12. Moreover, whenever gh = g'h' G G123 with g, g' G Gi U G3 and h, h' G G2, then 
g — g' and h = h! . This follows because g'~ 1 g = h'h^ 1 G F13 n G2 = {1}. Thus, the product 
hj-iQj uniquely defines G Gi U G3 and fej G G2, because ho — 1 is fixed. 

The invariant during a computation from left to right is that e(Tj) = hj-igj. We obtain 
e(Tj) — g'jhj by a basic splitting. If no splitting is possible we know that g F13 and we can 
stop. If however a splitting is possible, then we have two cases. If j is the last index (j = t), 
then, in addition, we must have hj = 1. We can test this. If the test fails, we stop with g $5 F13. 
If we are not at the last index we perform a swap. We split, then swap the right hand factor and 
multiply it with the next triple marking, which has the correct type to do so. As our sequence 
has been Britton-reduced the total number of triple markings remains constant. There can be no 
cancelations at this point. Thus, the test gives us the answer to "e(C) G -F13?" using O(s) basic 
operations. In the case e(C) G -F13 we still need to know whether e(C) = 1 G G123. For t > 1 
the answer is "no". It remains to deal with t = 1, But a test whether e([u, x, k]) — 1 just means 
to test both u = and x + k = 0. 

Now, assume we obtain a "yes" answer and we know e(£) G -F13. We do the swapping of 
types from left to right by using only the left factor in a splitting. These are additional s basic 
operations, hence the total number of O(s) did not increase. □ 

6 Conclusion and future research 

The Word Problem is a fundamental problem in algorithmic group theory. In some sense "almost 
all" finitely presented groups are hyperbolic and satisfy a "small cancelation" property, so the 
Word Problem is solvable in linear time! For hyperbolic groups there are also efficient parallel 
algorithms and the Word Problem is in NC 2 , see (5). On the other hand, for many naturally 
defined groups little is known. Among one-relator groups the Baumslag group G(i,2) was sup- 
posed to have the hardest Word Problem. But we have seen that it can be solved in cubic time. 
The method generalizes to the higher Baumslag groups Gf TO?n ) in case that m divides n, but this 
requires more "power circuit machinery" and has not worked out in full details yet, see 1211 . The 
situation for G(2,3) is open and related to questions in number theory. The Higman groups H q 
belong to another family of naturally appearing groups where the Word Problem was expected 
to be non-polynomial. We have seen that the Word Problem in H4 is in C(n 6 ). It easy to see 
that our methods show that the Word Problem in H q is always in P, but to date the exact time 
complexity has not been analyzed for q > 4. 

Baumslag and Higman groups are built up via simple HNN extensions and amalgamated 
products. Many algorithmic problems are open for such constructions, for advances about theories 
of HNN-extensions and amalgamated products we refer to 1161 . 

Another interesting open problem concerns the Word Problem in Hydra groups. Doubled hy- 
dra groups have Ackermannian Dehn functions (9), but still it is possible that their Word Problem 
is solvable in polynomial time. 
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Appendix 



Reduction of power circuits 

In this section we give a full proof of Thm. Q] We start with the observation that due to com- 
pactness of the markings in a tree representation, the leaves are automatically ordered by e-value. 
The leftmost has the smallest and the rightmost has the largest e-value. This easy calculation 
(essentially an argument about binary sums) is left to the reader. 
Next, we establish some operations on tree representations. 

Lemma 2. (Insertion of a new node) Let II = (F, 5) be a power circuit in tree representation 
and M a marking in II given as a leaf in the tree. Then we can in amortized time 0(\F\) insert 
a new node P into the circuit with Ap = M. 

Proof. If a node P with e(P) = 2 e ' A/ ^ already exists, we abort immediately. Otherwise the 
index of the leaf corresponding to the (compact) marking M inside the list of leaves tells us the 
position of P in the sorting of I. (Actually, we have to count the number of leaves of type Aq 
(Q e I) that are left of M.) 

Next, we need to update the bit vector b. This is achieved by the procedure described in 
Prop. Q] which tells us whether e(M) + 1 = e(Aq) where Q is the node succeeding P in the 
ordering of I. 

Finally, we have to "stretch" the tree defined by A by inserting a new level corresponding to 
the new node P. All the edges on that level have to be labeled by "0", as no marking uses the 
newly created node yet. This can be done in linear time using the lists of nodes we keep for each 
level. 

Note that this may increase the potential by \r\, since both \r\ and the number of chains 
might grow by one. This adds |_T| to the amortized time, which is captured by the O-notation. 

□ 

Lemma 3. (Incrementation of a marking) Given a marking M in a tree representation of Ft = 
(r, 5) one can generate in 0(\o(M) ) time a marking M' with e(M') — e(M) + 1. 

Proof. If r is empty, the claim is obvious. Otherwise let P be the unique node in r with e(P) = 
1. If M (P) 7^ +1, we increment M(P) by one and are done. Otherwise, we look for a node P' 
with e(P') = 2. If it doesn't exist (in which case r = {P} and e(M) = 1), we create it and put 
M(P) = and M(P') = +1. If P' does exist, then M(P') = due to compactness. Again, 
put M(P) = and M(P') = +1. 

Note that in the last case the newly created marking is not necessarily compact and therefore 
cannot be inserted immediately into the tree representation as a leaf. We will deal with this in the 
next lemma. □ 

Lemma 4. (Making a marking compact) Let FI — (r, S) be a power circuit in tree representa- 
tion and M be a marking in FI (not yet a leaf and in particular not compact; e.g. given as a list 
of signed pointer to nodes). Assume that for each node P £ a(M) the last node F in the longest 
drain starting in P is not marked by M. Then M can be made compact in time 0(\o(M)\) ( and 
after that integrated into II as a leaf in time O (| r\ ) ) without changing the circuit and without 
increasing the weight of M. 

Proof. We look at the nodes P £ er(M) in ascending order (w.r.t. their value). There are es- 
sentially two ways for M not to be compact at a point P: Assume that M(P) = +1 ( — 1 is 
similar). 
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1. ) P is the first node in a chain length 2 which M labels (+1, —1)- Replace it by ( — 1, 0). 

2. ) P is the first node of a chain (P = Pi,P 2 , . ■ . , Pk,Pk+i) labeled M(P) = +1 (1 < 

i < k) and M(Pk+i) 7^ +1. Note that by assumption Pk+i exists. Replace the labels by 
( — 1, 0, . . . , 0, +1). We might need to repeat this (if there is a node Pk+2 with e(Pk+2) = 
2e(P fe+ i) and M (Pk+2) = +1) but this ultimately stops at T. 

□ 

Often we need to increment a leaf marking by one and make it compact. In order to have the 
necessary nodes, we introduce the following concept: 

Definition 5. Let II — (P,&) be a power circuit in tree representation. A node J is called a 
joker, if it is the last in a maximal chain starting at the unique node with value 1 and J is not 
used in any leaf marking. 

Lemma 5. Let II — (P, S) be a power circuit in tree representation. Then we can in amortized 
time 0(\P\) insert a joker into the circuit. □ 

Proof. Start at the node Po with e(Po) = 1. Using the bit vector, find the first "gap" in the chain 
starting at Po, i.e., the largest i such that Po, . . . , P n -i exist with e(Pj) = 2'. Keep the number 
n in binary notation. Like in Lem.[4] compute a compact representation of n. Note that we are 
dealing with ordinary numbers here, not circuits! Use the compact representation of n to create 
P n with e(P n ) = 2". Check whether there is a node P n +i and adjust the bit vector. If yes, P n 
linked two maximal chains, so in amortized analysis we don't have to account for the 0(|P|) 
time used so far. Repeat the process until we create a node that is the end of a maximal chain. 
This is the joker. □ 

Now we are ready to prove Thm. [TJ 

Proof of Thm.\l\ Let n = |P| + \U\. We may assume n > 1. 

Perform a topological sorting of U, i.e. find an enumeration U = {Q(0), ■ ■ ■ , Q(\U\ — 1)} 
such that there are no arcs from any Q(i) to Q(j) when i < j. Since II is a dag, a topological 
ordering of U exists and it can be found in time 0(n \U\), see e.g. J6j. The nodes of U will be 
moved to P in ascending topological order, so that all the time P remains a circuit, i.e., there are 
no arcs from P to U. 

Let M be any one of the markings Mj (j = 1, . . . , m) or Ap (P £ U). While the nodes of 
U are being moved to P, there may be times when the support of M is partly in P and partly still 
in U . Later it will be completely contained in P. We will maintain the following invariants: 

i) Any marking whose support is completely contained in P is represented as a leaf (and thus 
compact). 

ii) For all other markings M and all nodes P G a(M) n P there is a chain starting at P and 
ending at a node T g' o(M) such that there is node node with double the value of e(T) (i.e. 
the chain cannot be prolonged at the top end). 

We now describe how to do the moving of the topologically smallest node Q of U. 

We have Aq C P, so by the invariant, this is a leaf. Hence it is possible to test whether 
e(Aq) < using Prop.[T] If this is the case, II is not a power circuit and we stop with the output 
"no". From now on we assume e(Q) > 1. 

1 . ) Insert a new joker 

See Lem.|5] Now we don't have to worry about incrementing compact markings anymore. 

2. ) Find a replacement P for Q in P 

Check whether there is a node P € P with value e(P) = e(Q). If not, use Lem.[2]to create 
it taking the marking Aq as Ap. This takes amortized time C(|P|). 
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3. ) Adapt markings using Q: 

Using the bit vector, go up the chain in r starting at P. Prolong the chain at the top by 
creating a new node P' (use Lem.[3]on the successor marking of the last node of the chain, 
insert it into the tree via Lem. |4]and use it as a successor marking for creating the new node 
P' with Lem.|2] The time needed is 0(\r\). 

Go through all markings M (Aqi for Q' € U and Mj for j = 1, . . . , m) that have Q £ 
a(M). Replace Q by P in M. If this leads to a double marking of P by M, replace those by 
the next node in the chain. Again, that node might become doubly marked by M, so repeat 
this. This stops at the latest at P' which is new and thus unmarked M. For each of these steps, 
the support of M decreased by one, so the total time (for all Q £ U) is bounded by the sum 
of the sizes of all supports, i.e., n ■ \U\ for successor markings and |cr(Mi)| + • • • + o"(M m )| 
for the markings Mj (j = 1, . . . , m). Now Q is not part of any marking anymore and can be 
deleted. 

4. ) Make markings compact: 

If Q was the last node of a marking M to be moved from U to r, we have to make M compact 
and create a leaf in the tree. This is done by using Lem. |4] Note that we have the invariant 
Qy. We need time 0(\a(AI)\ + n), which over the whole procedure sums up to 0(n ■ \U\) 
for successor markings and 0(|<j(Mi)| + ■ ■ ■ + |cr(j\/ m )| + m ■ n) for the markings Mj 
(j = 1, .. . ,m). 

5. ) Make room for later compactification: 

Start at P 1 and create a new node T that has value e(T) — 2e(P'). Check whether there 
is a node with value 2e(T). Is yes, the creation of T has linked two maximal chains, thus 
decreasing the potential by This pays for the time needed for creating T and the 

check. Repeat this until we create a T that has no node with double the value of T. Note that 
this takes only amortized time C(|_r|). 

□ 

A more detailed look at power circuits 
Lemma 6. The following assertions are equivalent: 

1. ) e(P) € 2 N for all nodes P, 

2. ) e(A P ) > for all nodes P, 

3. ) e(M) £ Z for all markings M. 

Proof. Choose some node P without incoming arcs which exists because (r, S) defines a dag. 
The assertions are equivalent onf\ {P} by induction. The result now follows easily. 

Proof of Cor.\2\ The procedure MakeTree uses ExtendTree which detects if e(P) g 2 N . 
The result follows from Lem. [6] □ 

The Word Problem of the algebra Z[l/2] x Z with swapping 

Proof of Thm.\2\ The proof is almost the same as the proof of Thm.[3]which has been given above. 
Hence we focus on the differences in the proof. The input is given by a sequence of pairwise 
disjoint power circuits each of them with a triple marking [U, X, K] representing an element in 
Z[l/2] xi Z. We use only the invariant that U, X, K have pairwise disjoint supports. Swapping of 
[u, x, k] is possible, if z = u2 x G Z and the result is either [x + k, z, 0] or [x + k, 0, z\ depending 
on the sign of z. In order to realize a marking for z we clone U. This increases the size by \o{U) \. 
At the end the size of the power circuit is quadratic in the input. This yields 0(n 4 ) time. □ 
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Dehn functions 

No result about Dehn functions is used in our paper. However, for convenience of the interested 
reader we recall the definition of a Dehn function as given by Wikipedia. Let G be given by a 
finite generating set X with a finite defining set of relations R. Let F(X) be the free group with 
basis X and let w £ F(X) be a relation in G, that is, a freely-reduced word such that w — 1 in 
G. Note that this is equivalent to saying that is, w belongs to the normal closure of R in F(X). 
Hence we can write w as a sequence of m words xrx~ 1 with r e R ±x and x € F(X). The area 
of w, denoted Area(«j), is the smallest m > such that there exists such a representation for w 
as the product in F(X) of m conjugates of elements of i?* 1 . Then the Dehn function of a finite 
presentation G = (X \ R) is defined as 

Dehn(n) = max {Area(w) | w = 1 € G, \w\ < n, and w is freely-reduced} . 

Two different finite presentations of the same group are equivalent with respect to domination. 
Consequently, for a finitely presented group the growth type of its Dehn function does not depend 
on the choice of a finite presentation for that group. 

A function / : N -> N is dominated by g : N — s> N, if there exists c > 1 such that 

/(n) < cg(cn + c) + cn + c for all n £ N. 
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